Keyword data informs and impacts every other SEO task, including finding content topics, on-page SEO, email outreach, and content promotion. While a wealth of companies provide granular keyword data, a birds-eye view of Google Searches is missing in the industry. This large-scale analysis aims to fill the gap.
As a data partner, DataForSEO provided the raw data for the analysis through their keyword database [https://dataforseo.com/databases]. The analysis was limited to US keyword data that accounted for around 306 keyword searches. Keyword monthly searchers with volume of 0 were removed from the data set. The data was queried from a Google BigQuery database.
To attain additional insights from the data, we enriched a random sample of around 2.5 million keywords with the Ahrefs Keyword Explorer. This includes the data on difficulty, return rate, clicks, region volume, and SERP features.
Below an overview of the data:
| Statistic | Value |
|---|---|
| Total number of searches | ~306 million |
| Total volume of searches | ~32 billion |
| Searches with missing volume | 0.51% |
| Mean search volume | 989 |
| Median search volume | 10 |
| Mean CPC | 0.61 |
(For how the volume was calculated, see appendix.)
These are the most popular searches, with volume based on ahref:
| Keyword | Volume |
|---|---|
| youtube | 0.546% |
| 0.530% | |
| amazon | 0.407% |
| gmail | 0.296% |
| 0.271% | |
| weather | 0.164% |
| yahoo | 0.161% |
| ebay | 0.161% |
| walmart | 0.145% |
| yahoo mail | 0.143% |
| netflix | 0.139% |
| google docs | 0.100% |
| translate | 0.098% |
| usps tracking | 0.093% |
| news | 0.091% |
| craigslist | 0.091% |
| fox news | 0.091% |
| cnn | 0.083% |
| calculator | 0.073% |
| hotmail | 0.064% |
| roblox | 0.063% |
| target | 0.063% |
| 0.057% | |
| msn | 0.057% |
| trump | 0.054% |
| 0.054% | |
| bank of america | 0.051% |
| new year | 0.051% |
| maps | 0.050% |
| nfl | 0.044% |
| ups tracking | 0.042% |
| 0.041% | |
| 0.041% | |
| espn | 0.038% |
| disney plus | 0.037% |
| etsy | 0.036% |
| usps | 0.035% |
| finance | 0.033% |
| aol | 0.029% |
| women’s world cup 2019 | 0.026% |
| nba | 0.024% |
| you | 0.023% |
| amazon prime video | 0.022% |
| internet speed test | 0.021% |
| bed bath and beyond | 0.021% |
| ikea | 0.020% |
| dow | 0.018% |
| food near me | 0.018% |
| united airlines | 0.018% |
| speedtest | 0.017% |
The search volume is concentrated among the top searches:
| Searches | Volume |
|---|---|
| Top 50 | 4.9% |
| Top 500 | 8.4% |
| Top 2000 | 12.2% |
| Top 10k | 18.8% |
If we divide the searches into categories based on how much volume they receive, this gives an overview:
These figures together show that low volume searches (individual searches with volume less than 100), constitute 91.8% of all searches, but only 3.3% of all volume. Or stated in another way, 91.8% of keywords get 1-100 volume a month. Whereas very high volume searches (individual searches with volume above 10,000) constitute only 0.2% of all searches, but have 75.2% of all volume.
The top searches have high volume compared to the rest, and the search volume level quickly levels off. The picture looks slightly different depending on whether we use the values from DataforSEO:
Or ahref:
If a misspelling is recognized, a so-called spell type is suggested. There are three types of spell shown in the table below. ~1.4% of searches have a spell type, but those that do have tend to have high volume.
14.1% of searches are in the form of a question. “how” is the most common question word.
A list of stopwords was qcquired from here: https://meta.wikimedia.org/wiki/Stop_word_list/google_stop_word_list#English
“how” and “the” are the most common stopwords, which are present in 6-8% of searches.
A colorful version:
The most searched queries have length 5-10 characters, and falls continuously for search queries longer or shorter than that.
The mean keyword length in characters is 11.2, and the median is 8.5.
Most searches consist of a single word, with volume quickly falling as the number of words increase.
Search terms that are 1-3 words in length get ~10x as much volume on average as search terms that are 5+ words in length.
Internet & Telecom is the keyword category with the highest mean volume.
Arts & Entertainment, Internet & Telecom, and News, Media & Publications have the highest total volume.
Finance has the highest mean cost per click.
The average CPC for all searches is 0.61
Description of keyword difficulty from ahref:
Keyword Difficulty (KD) effectively tells you the average height (backlink profile strength) of the other players in the game (top-ranking pages). The higher the number, the more height (backlinks) you’ll likely to need to stand a chance of ranking.
As volume increases, the difficulty increases.
The slope of the linear regression line is such that for each doubling of the volume, the difficulty increases by 1.63. For example, as the volume goes from 100 to 3200 (6 doublings), the difficulty increases by roughly 1.63 * 6 ~= 10.
Higher difficulty also means higher CPC on average. Note that the Y axis is logarithmic, so a small move on the y axis is a large increase in value.